An Exploration of Approaches to Integrating Neural Reranking Models in Multi-Stage Ranking Architectures

نویسندگان

  • Zhucheng Tu
  • Matt Crane
  • Royal Sequiera
  • Junchen Zhang
  • Jimmy Lin
چکیده

We explore di‚erent approaches to integrating a simple convolutional neural network (CNN) with the Lucene search engine in a multi-stage ranking architecture. Our models are trained using the PyTorch deep learning toolkit, which is implemented in C/C++ with a Python frontend. One obvious integration strategy is to expose the neural network directly as a service. For this, we use Apache Œri‰, a so‰ware framework for building scalable cross-language services. In exploring alternative architectures, we observe that once trained, the feedforward evaluation of neural networks is quite straightforward. Œerefore, we can extract the parameters of a trained CNN from PyTorch and import the model into Java, taking advantage of the Java Deeplearning4J library for feedforward evaluation. Œis has the advantage that the entire end-to-end system can be implemented in Java. As a third approach, we can extract the neural network from PyTorch and “compile” it into a C++ program that exposes a Œri‰ service. We evaluate these alternatives in terms of performance (latency and throughput) as well as ease of integration. Experiments show that feedforward evaluation of the convolutional neural network is signi€cantly slower in Java, while the performance of the compiled C++ network does not consistently beat the PyTorch implementation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Ratio-Based Efficiency Measurement for Ranking Multi-Stage Production Systems in DEA

Conventional data envelopment analysis (DEA) models are used to measure efficiency score of production systems when they are considered as black boxes and their internal relationship is ignored. This paper deals with a common special case of network systems which is called multi-stage production system and can be generalized to many organizations. A multi stage production system has some stages...

متن کامل

An Overview of Nonlinear Spectral Unmixing Methods in the Processing of Hyperspectral Data

The hyperspectral imagery provides images in hundreds of spectral bands within different wavelength regions. This technology has increasingly applied in different fields of earth sciences, such as minerals exploration, environmental monitoring, agriculture, urban science, and planetary remote sensing. However, despite the ability of these data to detect surface features, the measured spectrum i...

متن کامل

An Optimum Selection of Simulated Geological Models by Multi-Point Geostatistics and Multi-Criteria Decision-Making Approaches; a Case Study in Sungun Porphyry-Cu deposit, Iran

An accurate modeling of sophisticated geological units has a substantial impact on designing a mine extraction plan. Geostatistical simulation approaches, via defining a variogram model or incorporating a training image (TI), can tackle the construction of various geological units when a sparse pattern of drilling is available. The variogram-based techniques (derived from two-point geostatistic...

متن کامل

Selection of Sustainable Supplier for Medical Centers with Data Envelopment Analysis (DEA) & Multi-Attributed Utility Theory (MAUT) Approaches

Background and Objectives: The selection of the sustainable supplier is important for any industry. Medical centers are not an exception in this case, and selecting the best sustainable supplier is a major step towards increasing their productivity. This paper, using the Data Envelopment Analysis and then using Multi-Attributed Utility Theory as a backup approach to fix errors, attempts t...

متن کامل

Prediction of Gain in LD-CELP Using Hybrid Genetic/PSO-Neural Models

In this paper, the gain in LD-CELP speech coding algorithm is predicted using three neural models, that are equipped by genetic and particle swarm optimization (PSO) algorithms to optimize the structure and parameters of neural networks. Elman, multi-layer perceptron (MLP) and fuzzy ARTMAP are the candidate neural models. The optimized number of nodes in the first and second hidden layers of El...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1707.08275  شماره 

صفحات  -

تاریخ انتشار 2017